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Executive  Summary 

In  this  report,  we  provide  a  theoretical  proof  and  an  experimental  verification  of  a  new  approach  to 
invariant  automatic  target  recognition  (ATR)  proposed  by  Physical  Optics  Corporation  (POC).  The 
core  of  the  concept  is  to  determine  the  so-called  archetype  image  for  a  class  of  all  images  derived 
from  a  given  object  through  scale,  translation  and  rotational  (STR)  changes.  This  archetype  image 
will  then  represent  the  whole  class  of  STR  transformations  and  can  be  used  for  further 
identification.  We  started  with  a  comparative  analysis  of  the  moments  transformation  which  is  the 
closest  related  approach.  Next,  we  developed  an  algorithm  for  bringing  an  input  image  to  its 
archetype  position.  We  tested  this  algorithm  in  the  presence  of  added  noise  and  varying 
illumination.  We  also  confirmed  the  robustness  of  the  algorithm  performance  experimentally  by 
using  an  optical  correlator  to  classify  the  obtained  archetype  images.  At  the  end  of  this  report  we 
outline  the  proposed  ATR  system  development 
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1.0  INTRODUCTION 

The  goal  of  this  research  is  to  develop  a  new  concept  of  scale,  translation,  and  rotation  invariant 
optical  processing  for  ATR.  Research  on  invariant  ATR  has  been  underway  for  the  last  thirty  to 
forty  years,  motivated  by  various  military  and  commercial  appUcations,  the  latter  including  on-line 
control,  robotic  vision  and  biomedical  analysis  (such  as  used  in  cytopathology).  The  general 
requirements  for  invariant  ATR  have  been  developed  over  the  years  and  can  be  stated  as: 

-  Invariant  mapping  of  an  input  image  onto  a  feature  space 

-  Unique  classification  and/or  identification. 

A  fully  invariant  system  would  perform  ATR  without  regard  to  translation  (shift),  rotation,  dilation 
(scale),  and  illumination  or  viewing  angle.  In  this  project,  we  limit  our  investigation  to  the  first 
three  invariant  features  only,  although  we  suggest  methods  for  incorporating  the  other  two. 

The  principle  of  invariant  ATR  can  be  reduced  to  a  block  diagram  shown  in  Figure  1-1.  The 
transforms  take  the  form  of  Fourier,  Gabor,  windowed,  wavelet  or  some  other  integral 
transformations.  Matching  can  be  done  either  by  using  a  prestored  filter  (matched  filter)  or  a 
continuously  updated  (joint  transform  correlator)  filter.  Final  identification  is  typically  done  by 
choosing  some  distance  function  in  a  feature  space  such  as  Euclidean,  Mahalanobis,  or  minimum 
entropy. 


Figure  1-1 

Block  diagram  of  ATR. 


Figure  1-2  shows  a  hierarchy  of  images  of  increasing  complexity,  beginning  with  simple  2-D 
binary  objects  at  the  left  and  increasing  in  complexity  to  the  right.  The  procedure  illustrated  in 
Figure  1-1  has  not  been  fully  successful  for  realistic  3-D  objects  whose  increased  image 
complexity  can  be  described  by  Figure  1-2. 
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Figure  1-2 

Image  complexity  string. 


We  propose  to  reduce  the  image  complexity  described  in  Figure  1-2  by  following  the  steps  in 
Figure  1-3  which  correspond,  box  for  box,  to  Figure  1-2. 


Figure  1-3 

Reducing  image  complexity. 


In  Phase  I,  we  concentrated  on  the  nonlinear  optical  processing  and  electronic  identifier  stages, 
which  correspond  to  the  analysis  of  2-D  gray  level  objects  (or  2-D  projections  of  3-D  objects). 
The  problem  of  variable  illumination  and  shadowing  can  be  overcome  through  the  use  of  passive 
IR  imaging.  The  problem  of  3-D  ATR  includes  out-of-plane  rotations  which  constitute  different 
objects  and,  thus,  cannot  be  related  to  each  other  by  any  transformation  unless  full  knowledge  of 
the  object  is  available.  To  identify  an  object  from  any  of  its  projections,  all  of  the  possible 
projections  must  be  stored  as  training  patterns.  This  large  memory  storage  requirement  is  reduced 
by  the  compression  inherent  in  the  proposed  STRI  processor. 
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2.0  PREVIOUS  WORK  ON  INVARIANT  PATTERN  RECOGNITION 

In  this  section,  we  describe  advances  made  in  STRI ATR  to  date. 

Pattern  recognition  is  of  interest  in  many  disciplines,  since  it  can  provide  automatic  recognition  of 
signals  and  objects.  Optical  pattern  recognition,  in  particular,  will  provide  tools  for  machine 
vision,  robotics,  automation  and  image  understanding,  by  taking  advantage  of  the  parallel 
processing  capabilities  of  optics. 

One  of  the  main  demands  placed  on  a  recognition  system  is  that  it  be  invariant  to  changes  in  input, 

i.e.,  that  it  enable  object  recognition  or  classiHcation,  even  when  the  object  is  distorted  due  to 
changes  in  position,  rotation  or  size. 

Even  digital  ATR  is  a  challenging  task.  There  is  no  universal  approach  that  can  recognize  targets  in 
any  circumstances.  The  key  issue  is  the  feature  extraction  that  determines  the  pattern  recognition. 
Human  beings  easily  determine  the  most  significant  features  of  images.  Machines  nuy  not  be  able 
to  adaptively  choose  the  feature  extraction  algorithm  that  is  suitable  to  the  circumstance. 

We  evaluated  the  performance  of  the  following  algorithms  of  invariant  feature  extraction  and 
pattern  recognition: 

1 .  Orthogonal  Fourier- Mellin  Moments  (OFMMs); 

2.  Fourier  Descriptors  (FDs); 

3.  Geon^trical  approach  to  multiple  circular  harmonic  filters  (CHFs); 

4.  Phase-only  Synthetic  Discriminate  Function  (SDF)  filters; 

5.  Composite  wavelet  matched  filters; 

6.  Fourier-Mellin  Descriptors  (FMDs); 

7.  Image  normalization  using  low-order  Fourier-Mellin  moments; 

8.  Moment  invariants; 

9.  Phase-only  covariance  circular  harmonic  filters; 

1 0.  Wavelet  matched  filters. 

We  performed  computer  experiments  using  those  algorithms.  The  conclusion  of  our  investigation 
was  that  OFFMs  are  superior  over  other  approaches.  However,  this  method  is  computationally 
intensive  and  does  not  provide  sufficient  noise  tolerance  since  high  order  moments  are  involved. 
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2.1  Orthogonal  Fourier>Meilin  Moments  (OFMMs) 

OFMMs  are  numerical  features  describing  an  image.  OFMMs  are  scale,  rotation,  shift  and 
intensity  invariant.  They  are  highly  condensed  image  features,  useful  for  image  classification  in 
feature  space  12]. 

The  advantage  of  this  approach  is  that  an  image  and  all  its  rotated  and  scaled  versions  can  be 
described  and  classified  by  a  small  number  of  OFMM  features.  The  OFMMs  provide  a  high 
degree  of  data  compression.  They  are  orthogonal.  Hiere  is  no  information  redundancy  among  the 
OFMM  features. 

The  circular  Fourier  and  radial  Mellin  transform  (FMT)  is  defined  in  a  polar  coordinate  system 
(r,0)  as 


2jr  1 

Ms.m  =  J  I  r*  f(rt0)  exp  r  dr  d0  ,  (2-1) 

0  0 

where  f(r,0)  is  the  image  and  m  =  0,  ±1,  ±2, ...  is  the  circular  harmonic  order.  By  definition,  the 
Mellin  transform  of  order  s  is  complex.  With  integer  s  >  0,  Eq.  (2-1)  defines  the  Fourier-Mellin 
moments  (FMMs),  also  called  the  rotational  moments.  In  the  FMMs,  the  set  of  circular  Fourier 
basis  functions  {exp  (-jm0)}  is  orthogonal  but  the  set  of  radial  monomials  (r*)  is  not  orthogonal. 
The  radial  orders  of  the  FMMs  are  highly  correlated,  resulting  in  information  redundancy.  To 
make  the  radial  moments  orthogonal,  we  replace  the  kernel  function  r®  of  the  FMMs  by  a  set  of 
polynomials  Qn(r)  in  r  of  degree  n.  The  set  of  Qn(r)  is  orthogonal  with  respect  to  a  weight  function 
r  over  the  range  0  <  r  <  1 : 


J Qn  (r)  Qk  (r)  r  dr  =  an  6nk  ,  (2-2) 

0 

where  8nk  is  the  Kronecker  symbol,  an  is  a  normalization  constant,  and  r  =  1  is  the  maximum  size 
of  the  objects  that  can  be  encountered  in  a  particular  application.  The  orthogonal  radial 
polynomials  rather  than,  for  example,  the  Fourier  transform  basis  are  used,  because  the  radial 
moments  can  be  normalized  to  be  invariant  under  scale  changes,  as  will  be  shown  below. 
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The  OTthogonal  Fourier-Mellin  moments  (OFMMs)  are  defined  as 

1  2k  1 

=  - -  f  f  f  (r,0)  Q„  (r)  exp  (-jme)  r  dr  dB  .  (2-3) 

1C  ^  * 

0  0 

and  are  integrable  when  the  degree  of  Qn(r).  n  ^  0.  TTie  basis  functions  (ii(r)  exp(-jm0)  of  the 
OFMMs  are  orthogonal  over  the  interior  of  the  unit  circle. 

The  orthogonal  polynomials  Qn(r)  are  obtained  by  applying  the  Schmidts  process  of 
orthogonalization  to  a  sequence  of  natural  powers  of  r  over  the  range  0  <  r  ^  1 

1,  r,  r2,  r”  .  (2-4) 


By  using  Jacobi  polynomials  Gn(p,q,r),  we  obtain  the  polynomials  Qn(r)  corresponding  to  the 
Jacobi  polynomials  with  p  =  q  =  2: 

Qn  (r)  =  (-l)"  ‘)0„  (2,2,r)=  X“«s  •  »-5) 

s=0 


with 


=  r-n"  +  s  {n  +  s  +  \) 

(n  -  s) !  s !  (s  1)! 


(2-6) 


The  normalization  constant  in  Eq.  (2-2)  is  an  =  l/[2(n  +  1)].  The  explicit  expressions  of  Qn(r) 
are: 


Qo(r)  =  l 

Q,(r)  =  -(2-3r) 

Q2(r)  =  3-12r  +  10r2 

Q3  (r)  =  -  (4  -  30r  60r^  -  35r^) 

Q4  (r)  =  5  -  60r  -t-  210r^  -  280r^  -i-  \26t^ 

Q5  (r)  =  -  (6  -  105r  SbOr^  -  260r^  +  1260t^  -  462r^) 
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Substituting  Eqs.  (2-1)  and  (2-5)  into  Eq.  (2-3),  we  express  the  OFMMs  as  linear  combinations  of 
the  FMMs 


n  + 1  " 

^sm  •  (2-7) 

”  s=0 

The  OFMMs  can  be  normalized  to  be  invariant  under  rotation,  scale,  and  intensity  changes,  as  we 
will  show  below. 

The  FMMs  and  the  OFMMs  can  also  be  defined  in  the  (Cartesian  coordinate  system  as  modified 
complex  moments  (CMs)  and  orthogonal  CMs,  respectively.  This  is  important  because,  in  most 
cases,  images  are  presented  in  a  (Dartesian  coordinate  system.  Calculating  the  OFMMs  directly  in 
the  Cartesian  coordinate  system  can  avoid  the  coordinate  transform  of  the  image  from  the  C^artesian 
to  the  polar  coordinate  system. 

The  orthogonal  basis  functions  Qn(r)  exp  (-jm6)  of  the  OFMM  can  be  expressed  as  complex 
polynomials  in  (x  +  jy)  and  (x  -  jy).  The  CMs  are  defined  in  both  the  Cartesian  and  the  polar 
coordinate  system 


^  pq = J  J  jy)**  ~  jy)**  ^y 

— oo 

2,1 

=  J  J  f(r,0)  r^  exp  (-jmS)  r  dr  d0 
0  0 

with  s  =  p  +  q  and  m  =  q  -  p.  By  the  definition  of  Abu-Mostafa  and  Psaltis,  the  orders 
p  and  q  are  non-negative  integers.  From  Eqs.  (2-5)  and  (2-7), 

n  n 

Qn  (0  exp  (-jm0)  =  ^  r®  exp  (-jm0)  =  (x  +  jy)P  (x  -  jy)**  ,  (2-9) 

s=0  s=0 


are  a  set  of  complex  polynomials  of  degree  n  =  p  +  q,  where 

s-m  .  s+m 

p  = -  and  q  = - 

2  ^2 


(2-10) 
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orthogonal  inside  a  unit  circle.  Hence,  the  OFMM  can  be  expressed  as  a  linear  combination  of  the 
CMs  as 


^nm  “  ^  ^  s-m  s  +  m  (2-11) 

^  s=0 

where  the  orders  and  CMs,  Cpq,  are  real-valued  and  can  be  negative-valued. 

Under  the  conditions  that  both  p  +  q  =  s  and  q  -  p  =  m  are  integers  and  s  =  p  +  q  >  0,  the 
modified  CMs  are  convergent  and  integrable. 

In  the  following  experiments,  the  OFMMs  will  be  calculated  as  the  orthogonal  CMs  expressed  in 
Eq.  (2-11). 


2.2  Procedure  for  Computing  the  OFMM  Features 

The  steps  in  computing  the  OFMMs  of  an  image  follow. 

1 .  Determine  the  center  of  the  image  by  using  the  first  order  of  the  geometrical  moments  of  the 
image: 


xo 


mio 

*"00 


mpi 

yo  =  — ^ 

*^00 


where  the  geometrical  moments  mpq  are  defined  as 


mpq=  JJf(x,y)xP  y**  dxdy  . 


(2-12) 


(2-13) 


With  the  center  of  the  image  as  the  origin  of  the  coordinate  system,  the  moments  calculated 
in  this  coordinate  system  would  be  shift  invariant. 

2.  Calculate  the  FMMs,  Msm,  defined  in  Eq.  (2-1),  by  calculating  the  corresponding 
modified  complex  moments  Cpq  described  in  Eq.  (2-8)  with  p  =  “““  and  q  = 

in  the  (Dartesian  coordinate  system; 
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3.  Normalize  the  FMMs  for  the  scale  and  illumination  invariance.  When  the  object  f(r,0)  is 
scaled  by  a  factor  k  and  its  intensity  is  changed  by  a  factor  g,  its  FMMs  become 


27tl 


-  nl^S  +  2 


Msm  =  J  J  gf  (r/k,  0)  r*  rdr  d0  =  gk 
0  0 


M 


sm 


(2-14) 


The  scale  and  intensity  factors  k  and  g  are  obtained  using  the  low  order  FMMs: 


Mqo  =  gk  M(X) 
Mio  =  gk^  Mjo 


(2-15) 


The  scale  factor  k  is  equal  to 


k  = 


^  MiO  ^ 

Mqo 


Mqo 


(2-16) 


The  intensity  factor  g  is  equal  to 


^Mio' 


Mqo 


'  Mio  ^ 

<Moo 


Moo 


V^oo; 


(2-17) 


Using  the  factors  k  and  g,  the  FMMs  may  be  normalized  as 


_ 


gk 


s+2 


(2-18) 


The  normalized  FMMs  are  scale  and  intensity  invariant 

4.  Calculate  the  OFMMs  using  Eq.  (2-7)  and  the  normalized  FMMs.  A  rotation  of  the  image 
by  an  angle  0o  results  in  the  same  phase  factor  exp  (jm0o)  for  all  the  Msm  in  Eq.  (2-7). 
The  modulus  of  the  OFMM,  » is  rotation  invariant. 
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4.  Calculate  the  OFMMs  using  Eq.  (2-7)  and  the  normalized  FMMs.  A  rotation  of  die  image 
by  an  angle  0o  results  in  the  same  phase  factor  exp  (jm6o)  for  all  the  Msm  in  Eq.  (2-7). 
The  modulus  of  the  OFMM,  ,  is  rotation  invariant. 

2.3  Conclusions  Regarding  the  Use  of  OFMMs 

Although  superior  to  other  known  STRI  algorithms,  OFMM-based  ATR  suffers  from 

-  low  noise  tolerance  (especially  if  it  is  introduced  far  from  the  center  of  the 
image  frame) 

-  computational  intensity  (for  a  large  number  of  moments) 

-  no  optical  (parallel  processing)  implementation. 


3.0  PROOF  OF  STRI  PREPROCESSING  ALGORITHM 

POC's  proposed  STRI  ATR  concept  eliminates  the  main  deficiencies  of  previous  algorithms:  the 
proposed  algorithm  is  robust  and  noise  tolerant  and  ready  for  optical  implementation  in  a  standard 
coherent  optical  system. 

The  proposed  idea  enhances  existing  optical  correlation  based  ATR  by  introducing  scale, 
translation  and  rotation  invariance  (STRI).  Previously,  STRI  was  achieved  by  utilizing  a  library  of 
matching  filters  corresponding  to  various  positions  of  rotation  and  scale.  We  find  that  simple 
preprocessing  can  eliminate  the  need  for  the  library  since  any  STR  changes  in  the  input  image  can 
be  nullified  by  bringing  the  input  image  to,  as  we  call  it,  its  archetype.  This  means  that  any  STR 
transformation  of  the  target  can  be  reduced  to  the  initially  defined  target  archetype.  The  archetype 
fully  determines  the  whole  class  of  STR  transformed  images  of  the  given  target. 

This  archetype  image  is  characterized  by  the  following  features; 

•  Its  center  of  gravity  is  coincident  with  a  predetermined  origin. 

•  It  has  a  certain  angle,  specific  to  a  given  image,  with  the  horizontal  and  vertical 
axes. 

•  It  always  has  the  same  predetermined  scale. 
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Translation  invariance  is  achieved  the  same  way  as  in  moment  transforms.  Namely,  for  a  given 
image  I(x,y),  the  center  of  gravity  (xc,  yc)  can  be  found  by  using  the  following  expressions: 


/|xl(x,y)dxdy 

X  =« _ 

JJI(x,y)dxdy 

Q 

Ji  yl(x.y)dxdy 

V  =  Q _ 

JJI(x,y)dxdy 

n 


where  Q  is  the  area  of 
inside  the  input  frame. 


an  input  frame.  The  assumption  is  made  that  the  input  image  is  contained 


Then,  by  shifting  the  input  image  by  a  vector 


yc 


jx'  =  x-Xc 

ly’  =  y-yc 


(3-2) 


we  bring  the  image  to  the  origin.  (x',y')  are  the  new  coordinates  of  the  image. 

The  most  striking  aspect  of  the  proposed  STRI  concept  is  the  rotation  invariance  property.  In 
order  to  prove  it,  let  us  consider  a  function  \|f(6)  that  is  obtained  as  a  result  of  the  integration  of  the 
input  image  in  polar  coordinates: 


R2n 

¥(0o)=/  /  cos0I(r,  0-0Q)rd0dr,  (3-3) 

0  0 


where  R  is  the  radius  of  the  smallest  circle  covering  the  input  frame. 


By  changing  the  variable  to  <p  =  0  -  0o  and  using  the  formula  for  cos  (tp  +  0o), 

R  27r  R  2n 

\|/(0o)  =  cos  00  j  /costp  I(r,<p)drrd(p+ sin0o  J  Jsintp  I(r,(p)rdrd<p  .  (3-4) 

0  0  0  0 

If  we  call  the  first  double  integral  A  and  the  second  B,  then  the  short-hand  notation  of  Eq.  (3-4)  is 

\|/(0o)  =  A  cos  00  -  B  sin  0o  ,  (3-5) 
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or 


Vo(®o)  =  cos  00  cos  a  -  sin  0o  sin  a  =  cos(0o  +  a) 


(3-6) 


where 


cos  a  =  A/(A2  +  B2)1/2  ,  sin  a  =  B/(A2  +  B2)1/2  ,  yg  =  y/(A2  +  b2)1/2  .  (3-7) 


Clearly,  in  the  interval  from  0  to  2ji,  Vo(®o)  has  a  single  minimum,  Vo(®o)  =  *1.  which  is 
achieved  at 


00  =  Jt  -  a 


(3-8) 


regardless  of  the  rotation  of  the  input  image.  Thus,  by  computing  the  angle  0o  (0  <  0o  ^  2jt)  by 


00  =  JC  -  arctan  (A/B)  , 


(3-9) 


where 


R27t 

A  =  J  /cos9  I(r,(p)rdr  dtp 
0  0 
R2n 

B  =  J  Jsin  <p  I(r,<p)rdr  dtp 
0  0 


(3-10) 


and  rotating  the  input  image  by  0o,  the  input  image  is  brought  to  the  archetypal  angular  position. 

The  last  step  is  to  normalize  any  changes  in  scale.  This  is  achieved  by  computing  the  input  image 
normM: 


M=  JJI(x,y)dxdy  ,  (3-11) 

Q 

and  changing  the  coordinates  by  a  scaling  factor  Vm  .  This  will  assure  that  the  input  image  will  be 
normalized,  regardless  of  its  initial  scale.  Indeed,  if  the  input  image  is  zoomed  by  a  factor  c,  then 
its  new  norm  is 


Me  =  //I(cx,cy)dxdy  =  M/c^  , 
Q 


(3-12) 
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and  changing  the  coordinates  by  a  scaling  factor  Vm  .  This  will  assure  that  the  input  image  will  be 
normalized,  regardless  of  its  initial  scale.  Indeed,  if  the  input  image  is  zoomed  by  a  factor  c,  then 
its  new  norm  is 


Me  =  //l(cx,cy)dxdy  =  M/c^  ,  (3-12) 

Cl 

and  by  changing  the  input  image  scale  by  a  factor  of  VMc  ,  we  again  bring  the  image  to  the  same 
scale 


I(cx  •  VMc  ,  cy  •  VMc)  =  I(x  .  M,  y  •  M)  , 


(3-13) 


which  is  an  image  archetype,  since  M  is  constant  for  a  given  image. 


4.0  DIGITAL  IMAGE  ROTATION 

The  STRI  transformation  expressed  by  Eqs.  (3-2),  (3-9)  and  (3-12)  has  been  successfully  tested  at 
POC. 

POC's  image-processing  system  consists  of  a  Cohu  CCD  camera  connected  to  a  Matrox  PIP  video 
digitizer  board  installed  in  an  IBM-compatible  486/66  personal  computer.  A  black  and  white  TV 
monitor  was  used  to  display  the  images.  The  image-processing  station  is  shown  in  Figure  4- 1 . 
The  digitizer  board  is  supplemented  with  a  library  of  C-language  commands  which  were  used  to 
write  the  image-processing  routines.  The  following  is  a  summary  of  the  tasks  performed  by  these 
routines: 


12 


Final  Rpt  0394  3225  SDI  ATR 
N00014  93-C  0135 


Figure  4-1 

POC's  image  processing  station. 


First,  the  image  of  the  object  obtained  by  the  CCD  camera  is  digitized  into  a  256  x  256  array  of 
8-bit  pixels.  This  image  is  then  thresholded  to  set  the  background  and  shadows  to  zero.  At  this 
point,  the  center  of  gravity  of  the  object  is  calculated  with  Eq.  (3-1). 

The  object  is  then  moved  such  that  the  center  of  the  image  area  coincides  with  the  center  of  gravity 
of  the  object.  The  translation  accuracy  has  an  error  not  larger  than  one  pixel,  which  results  from 
rounding  up  the  numbers  computed  with  Eq.  (3-1). 

The  next  task  is  to  calculate  the  angle  through  which  the  object  is  to  be  rotated  to  reach  its 
archetype.  This  angle,  6(),  is  calculated  with  the  following  equation: 

9()  =  7t  -  arc  tan  (A/B) 

R271  R2jt 

where  A=j  j  cos({)  I(r,(t))r  dr  d<J)  ,  B=j  j  sintj)  l(r,0)r  dr  dtj)  .  (3-14) 

0  0  0  0 

The  rotation  task  presents  problems  not  associated  with  simple  translation.  Image  rotation  is  a 
geometric  process  where  one-to-one  mapping  of  source  to  destination  pixels  is  not  guaranteed. 
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The  rotation  task  presents  problems  not  associated  with  simple  translation.  Image  rotation  is  a 
geometric  process  where  one-to-one  mapping  of  source  to  destination  pixels  is  not  guaranteed. 
Without  this  one-to-one  correspondence  between  source  and  destination  pixels,  it  cannot  be 
guaranteed  that  one  source  pixel  will  be  mapped  to  every  destination  pixel.  This  creates  voids  that 
make  the  rotated  images  unusable.  We  employ  reverse  mapping  to  prevent  these  voids.  Reverse 
mapping  traverses  the  destination  image  space,  a  pixel  at  a  time,  and  calculates,  via  the  rotation 
transformation,  which  pixels  of  the  source  image  would  be  involved  in  producing  the  destination 
pixel.  When  the  value  of  the  destination  pixel  is  calculated  in  this  manner  and  placed  into  the 
destination  image,  complete  coverage  of  the  destination  image  is  guaranteed. 

The  other  problem  associated  with  rotation  is  that  hr^tionai  pixel  addresses  may  be  calculated  by 
the  rotation  transformation  equations.  This  results  in  the  calculated  source  image  pixel  lying 
somewhere  between  four  valid  pixel  locations  in  the  source  image,  again  posing  the  problem  of  a 
lack  of  one-to-one  pixel  mapping.  We  first  used  the  nearest  neighbor  approximation,  where  the 
fractional  address  of  the  source  pixel  is  translated  to  the  nearest  integer  pixel  address.  This, 
however,  resulted  in  blocky  and  jagged  images  which  were  less  than  perfect.  The  solution  to  this 
problem  was  to  use  the  interpolation  method  shown  in  Figure  4-2. 


Pixel  Columns 


Ar 

T _ T 

L 

r 

■ 

Px 

■ 

c*- 

1 

Transformed  Pixel 

Position  after  Rotation 

Source  image  Destination  Image 

Figure  4-2 

Interpolation  method. 


Rotation  Transfomration 
Equations: 

X’ »  X  cos  00  +  y  sin  00 
y'  -  y  cos  00  -  X  sin  0o 


The  four  points  surrounding  the  point  being  calculated  (Px)  are  referred  to  as  the  neighborhood. 
Linear  interpolation  assumes  that  the  contribution  of  a  pixel  in  the  neighborhood  varies  linearly 
with  its  distance  from  a  given  point.  Since  we  are  interpolating  in  two  dimensions,  we  perform 
three  interpolations  for  the  intensity  contribution  from  pixels  A  and  B,  the  contribution  from  pixels 
C  and  D,  and  the  contribution  from  AB  and  CD. 
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The  final  problem  associated  with  image  rotation  is  the  aspect  ratio  consideration.  Because  the  TV 
monitor  is  not  square,  the  pixels  are  not  square.  Rotation  of  nonsquare  pixels  results  in  distorted 
images.  This  problem  was  solved  by  using  appropriate  horizontal  and  vertical  scale  factors.  The 
combination  of  reverse  mapping,  linear  interpolation,  and  aspect  ratio  correction  results  in  the  best 
possible  rotated  images. 

Two  experiments  were  performed  to  test  the  operation  of  the  image  processing  station.  In  the  first 
experiment,  the  image  of  an  object  was  thresholded,  translated  to  the  center,  and  rotated  through  its 
calculated  characteristic  angle.  The  original  image  of  the  object  was  then  rotated  through  a  random 
angle  in  the  computer,  and  the  resultant  image  was  processed  through  the  above  routines.  The  two 
final  images  were  compared  and  found  to  be  identical.  In  the  second  experiment,  the  object  was 
located  in  two  different  positions  and  the  program  brought  it  to  its  archetype  position.  This 
experiment  also  produced  successful  results. 


5.0  EXPERIMENTAL  PROOF  OF  STRI  ALGORITHM 

5.1  STRI  Algorithm  Demonstration 

A  set  of  3-D  objects,  airplane  models,  was  used  to  demonstration  the  STRI  properties  of  the 
proposed  algorithm  (see  Figure  5-1). 
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Figure  5-1 

A  set  of  models  used  for  the  STRI  experiment. 


For  every  model,  25  arbitrary  STR  transformations  were  performed  and  mean  square  errors 
(MSEs)  were  calculated  for  the  resulting  archetypes.  The  system  (see  Figure  4-1)  operates  in  real¬ 
time.  The  results  for  two  arbitrary  target  positions  was  displayed  in  less  than  3  seconds. 

The  error  for  the  two  arbitrary  STR  transformed  images  as  they  are  brought  to  the  archetype 
position  mainly  occurs  at  the  edges,  as  seen  in  Figure  5-2.  Two  randomly  scaled  images  of  the 
same  model  displaced  from  the  center  of  the  frame  and  rotated  are  shown  at  the  top  of  the  screen 
while  one  archetype  (bottom  left)  and  the  difference  between  the  two  archetypes  is  shown  at  the 
bottom  right  of  the  screen. 
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Figure  5-2 

Example  of  STRI  processing  performed  on  two  airplane  models. 
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The  MSE  of  the  STRI  processing  for  the  four  models  shown  in  Figure  5-2  is  summarized  in 
Table  5- 1 .  The  processor  demonstrated  robust  performance  over  a  wide  range  of  scale  changes 
(300%),  full  360°  rotation,  and  all  possible  target  translations  within  the  image  frame.  The 
resulting  archetypes  are  invariant  and  can  be  used  to  represent  the  whole  class  of  the  STR 
transformations  of  the  object. 


Table  5-1  MSE  of  STRI  Processor 


Model 

MSE  (%) 

A 

3.1 

B 

2.1 

C 

0.9 

D 

1.7 

5.2  Noise  Tolerance  and  Illumination  Invariance 

The  STRI  processor  was  tested  for  both  noise  tolerance  and  illumination  invariance.  Two  low 
contrast  objects  (letters)  were  introduced  into  the  imaged  scene  to  stimulate  additive  external  noise. 

Figures  5-3  through  5-5  illustrate  the  fact  that  the  STRI  system  can  successfully  operate  under  low 
contrast  noise  conditions.  Each  pair  of  photos  shows  the  same  model  subjected  to  a  random  STR 
transformation  and  then  returned  to  the  same  archetype  position  (shown  in  the  insert).  In  addition, 
the  images  in  Figure  5-6  were  taken  under  different  illumination  conditions  and  the  larger  image  is 
slightly  out  of  focus. 
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Figure  5-3 

Performance  of  the  STRI  processor  on  an  image  scene  with  low  contrast  external  noise.  Model  A 
IS  used  here.  The  insert  shows  the  system's  reduction  of  the  input  model  to  the  archetype. 
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Figure  5-4 

Same  as  Figure  5-3.  Model  C. 
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Figure  5-5 

Same  as  Figure  5-3.  Model  B.  In  addition,  the  illumination  level  is  different  for  the  two  images. 
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6.0  OPTICAL  IMPLEMENTATION 


The  developed  STRI  digital  processor  can  be  used  to  supply  an  invariant  image  archetype  as  an 
input  to  an  ATR  system.  Specifically,  standard  optical  correlators  fit  very  well  with  the  STRI 
algorithm  that  has  been  implemented  in  the  Phase  1  work.  The  schematic  diagram  of  the  resulting 
ATR  system  is  shown  in  Figure  6-1. 


Lens  SLM  Lens  Array 


Figure  6-1 

Schematic  of  the  optical  implementation  of  the  STRI  ATR  system. 


As  was  mentioned  earlier,  only  one  matched  filter  is  needed  for  every  target  to  accommodate  all 
possible  in  plane  STR  transformations.  This  is  in  sharp  contrast  to  the  standard  optical  correlator 
where  target  rotations  (typically  of  few  degrees)  and  scale  changes  (usually  larger  than  5%)  must 
be  recorded  as  a  separate  matched  filter  (or  combined  into  a  synthetic  filter).  The  photograph  in 
Figure  6-2  shows  a  hardware  implementation  of  an  ATR  system  demonstrated  at  P(X;.  For  this 
system,  the  previously  demonstrated  digital  processor  operates  as  a  preprocessor  for  the  optical 
correlator.  Since  the  matched  filter  for  the  archetype  of  the  given  target  is  recorded,  any  STR 
transformed  image  of  the  object  will  trigger  the  correlation,  resulting  in  a  strong  correlation  peak 
(see  Figure  6-3).  In  this  experiment,  a  DCG  based  matched  filter  accommodated  a  pair  of  targets 
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a  pair  of  targets  (Models  B  and  C)  through  angular  multiplexing.  They  were  presented  to  the 
correlator  using  a  liquid  crystal  spatial  light  modulator  (SLM),  as  shown  in  Figure  6-4. 


Figure  6-2 

POC's  correlator  combined  with  the  STRI  preprocessor. 
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Figure  6-3 

Auto  correlation  and  cross  correlation  peaks  observed  at  the  system  output. 
(The  bottom  line  seen  on  the  oscilloscope  is  the  intensity  scan.) 
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Model  B 


Model  C 
Figure  6-4 

Images  of  the  model  as  they  appeared  on  the  input  SLM  after  preprocessing. 
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Then,  the  STR  transfonned  images  trigger  the  corresponding  matched  filter  to  produce  a 
correlation  output,  as  is  seen  in  Figure  6-5.  Thus,  the  preprocessor  developed  in  this  project 
makes  the  correlator  fully  STR  invariant. 


Figure  6-5 

Correlation  output  from  the  multiplexed  matched  filter  for  the  two  target$  shown  in  Figure  6-4. 


7.0  CONCLUSION 

The  operating  principle  of  the  developed  STRI  system  is  summarized  in  the  flow  chart  in 
Figure  7-1. 


26 


Final  Rpt  0394.3225  SOI-ATR 
N00014-93-C-013S 


Archetype  of  Input  Image 


Figure  7-1 

Diagram  of  STRI  preprocessor  operation. 

The  proposed  concept  is  a  significant  step  in  the  development  of  ATR  systems,  since  it 

•  introduces  complete  factorization  of  the  input  space 

•  provides  full  (theoretically  unlimited)  STR  invariance 

•  does  not  require  computation  of  higher  order  moments  which  negatively 
affect  noise  tolerance 

•  minimizes  matched  filter  loads. 
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8.0  PHASE  II  SUGGESTIONS 

The  proposed  ATR  system  will  use  STRI  wavelet  transformation  (WT)  as  a  feature  extractor  and  a 
neural  network  (NN)  as  a  final  classifier.  The  direct  classification  by  the  NN  (without  data 
compression)  would  fail  because  of  the  large  number  of  input  neurons  required.  This  number 
equals  the  number  of  pixels  in  the  input  picture.  The  WT  representation  is  an  excellent  candidate 
for  data  compression.  However,  in  order  to  accommodate  all  possible  STR  transformations,  the 
NN  should  store  a  very  large  number  of  weight  matrices  corresponding  to  every  distinct  target 
pose.  This  again  makes  this  approach  less  practical.  The  STRI  feature  extractor  allows  the  whole 
class  of  STR  transformations  of  the  given  target  to  be  represented  by  a  single  set  of  wavelet 
coefficients.  It  greatly  reduces  the  NN  classifier's  load.  The  proposed  ATR  system  can  be 
practically  implemented  following  the  diagram  in  Figure  8-1.  While  image  acquisition  and 
preprocessing  use  existing  hardware  and  software,  the  innovative  part  of  the  proposed  ATR 
system  is  the  STR  invariant  WT  feature  extractor.  Notice  that  we  do  not  perform  any 
transformation  on  the  input  image.  In  the  digital  implementation  of  the  STR  invariant  WT  concept, 
we  need  only  compute  values  of  the  new  basic  wavelet  functions  at  the  nodes  of  the  old  pixel  grid. 
Starting  from  an  analytical  expression  for  the  new  function,  our  feature  extractor  preserves  all  the 
information  given  in  the  input  image  and  involves  no  additional  approximation  error. 


//\ 


Figure  8-1 

STR  invariant  WT  for  ATR. 
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